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DIGITAL SIGNAL PROCESSING TECHNIQUES FOR IMPROVING 
AUDIO CLARITY AND INTELLIGIBILITY 

RELATED APPLICATION DATA 

The present appHcation is a continuation-in-part application of U.S. Patent 

AppUcation No. 09/669,069 for TECHNIQUES FOR IMPROVING AUDIO CLARITY 

AND INTELLIGIBILITY AT REDUCED BIT RATES OVER A DIGITAL 

NETWORK filed on September 22, 2000, the entire disclosure of which is incorporated 

herein by reference for all purposes. The present application also claims priority from 

U.S. Provisional Applications No. 60/289,936 for DIGITAL SIGNAL PROCESSING 

TECHNIQUES FOR IMPROVING AUDIO CLARITY AND INTELLIGIBILITY filed 

on May 9, 2001, and No. 60/293,684 for DIGITAL SIGNAL PROCESSING 

TECHNIQUES FOR IMPROVING AUDIO CLARITY AND INTELLIGIBILITY filed 

on May 25, 2001, and U.S. Patent AppUcation 09/927,578 for DIGITAL SIGNAL 

PROCESSING TECHNIQUES FOR IMPROVING AUDIO CLARITY AND 

INTELLIGIBILITY filed August 6, 2001, the entire disclosures of which are 

incorporated herein by reference for all purposes. 

BACKGROUND OF THE INVENTION 

The present invention relates generally to digital signal processing, and more 
specifically to the processing of digital audio signals in a variety of contexts. 

At one point, the growth of the Intemet was doubling every 18 months, vrith over 
57 million Domain hosts as of July 1999. In the United States, over half of the 
population now has access to the Intemet. This rapid development, in addition to the 
concurrent evolution of a variety of other content delivery mechanisms, e.g., digital 
broadcasting, cable and satellite system, etc., has fueled the explosive development of 
the digital audio industry. However, the quahty of audio delivered by these various 
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mechanisms is often linaited by the low bit rate encoding schemes employed to deliver 
the audio, e.g., the MPEG layer 3 (MPS) encoding scheme. 

Radio stations, concerts, speeches and lectures are all delivered over the web in 
streaming form. Encoders such as Ihose offered by Microsoft and Real Audio reside on 
5 servers that deliver the audio stream at multiple bit rates over various types of 

connections (modem, Tl, DSL, ISDN etc.) to a listener's computer. Upon receipt, the 
streamed data is decoded by a player, e.g., RealPlayer software, that understands the 
particular encoding format. Similarly, cable and satellite television systems deliver 
streaming video and audio to set top boxes in users' homes which decode and playback 
10 the encoded content. 

Audio files (e.g., MP3 files) may also be downloaded over the Intemet for 
storage and later playback using any of a variety of mechanisms including, for example, 
the hstener's computer or any of a variety of available portable playback devices. 

Regardless of the mechanism by which digital audio is delivered to the listener, 
15 there are a number of issues relating generally to the clarity and intelligibility of 
reproduced audio from the listener's perspective. These issues relate to any type of 
system which reproduces acoustic signals from digitally encoded information, e.g., 
portable music players, home entertainment systems, etc. 

By way of example, in the encoding process of a typical low bit rate encoding 
20 scheme, e.g., the MP3 encoding scheme, undesirable artifacts are generated which 

interfere with the goal of faithfully reproducing a relatively high bandwidth signal (i.e., 
the original audio) using a low bandwidth technique (i.e., the low bit rate codec). 

Such artifacts may be dealt with, at least in part, by appropriate processing of the 
analog or digital audio signals at their source (e.g., by the digital audio broadcaster). 
25 This is typically accomplished using a variety of techniques involving expensive 
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hardware, software techniques with a high computational overhead, or both. 
Unfortunately, these costly techniques only deal with half of the equation. 

That is, the ranges of listening environments, music types, and listener 
preferences make it virtually impossible to provide sigaal processing at the digital audio 

5 source which appropriately enhances the listening experience for each end user. This is 
exacerbated in systems in which the loudness level across the variety of available content 
is inconsistent. The processing capabilities which would enable customization according 
to each user's preferences may, of course, be included in the user's device. However, the 
cost of doing so in either hardware or processing resources has heretofore been 

10 prohibitive, not to mention technically challenging. This is particular true for the low 
cost, portable devices consumers demand. 

It is therefore desirable to provide digital signal processing techniques which 
remove undesirable artifacts generated by digital encoding techniques (particularly low 
bit rate techniques), allow for customization of each listener's experience, and present a 

15 relatively small load on the processing resources of the audio delivery system. 



3 
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SUMMARY OF TEIE INVENTION 
According to the present invention, a variety of digital signal processor 
configurations are enabled which may be flexibly configured to enhance the clarity and 
intelligibility of digital audio. Regardless of the eacoding scheme employed, the 
delivery mechanism, the nature of the listening envkonment, or the preferences of the 
listener, the digital signal processors of the present invention may be configured to effect 
processing of the digital audio in a manner which enhances the listener's experience and 
imposes an acceptable level of computational overhead. 

That is, the present invention provides methods and apparatus for effecting multi- 
band processing of an original sampled signal. The original sampled signal is separated 
into a plurality of signal components each corresponding to one of a pliuraUty of 
fi-equency bands. The dynamic range associated with each one of the plurality of signal 
components is independently and dynamically controlled. At least one signal level 

associated with the pluraUty of signal components is modified. The signal components 

I 

are combined into a processed sampled signal. 

A further understanding of the nature and advantages of the present invention 
may be realized by reference to the remaining portions of the specification and the 
drawings. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Figs, la and lb show a simplified block diagram of a signal processor designed 

according to a specific embodiment of the present invention. 

Fig. 2 is a simplified block diagram of various stages of a multi-band crossover 

for use with various specific embodiments of the present invention. 

Fig. 3 is a flowchart illustrating operation of a crossover stage in the multi-band 

crossover of Fig. 2. 

Fig. 4 is a flowchart illustrating operation of an automatic gain control processing 
block according to a specific embodiment of the invention. 

Fig. 5 is a flowchart illustrating operation of a nonlinear automatic gain control 
processing block according to a specific embodiment of the invention. 

Fig. 6 is a block diagram illustrating the playing of audio files over a network 
according to a specific embodiment of the present invention. 

Fig. 7 is a block diagram illustrating tiie decoding of audio files according to a 
specific embodiment of the invention. 

Fig. 8 is a block diagram illustrating the playing of audio files over a network 
according to another specific embodiment of the present invention. 

Figs. 9a and 9b show a simplified block diagram of a signal processor designed 
according to another specific embodiment of the present uivention. 

Figs. 1 Oa and 1 Ob show a simplified block diagram of a signal processor designed 
according to yet another specific embodiment of the present invention. 

Fig. 1 1 is a simplified block diagram of a signal processor designed according to 
a fiuther specific embodiment of the present invention. 
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Figs. 12a and 12b are block diagrams illustrating the transmission and receiving 
sides of a digital audio broadcasting system according to a specific embodiment of the 
invention. 

Fig. 13 is a block diagram illustrating a satellite television system according to a 
5 specific embodiment of the present invention. 

Fig. 14 is a block diagram of a home entertairanent system designed according to 
a specific embodiment of the invention. 

Fig. 15 shows a 3-band signal processor designed according to another specific 
embodiment which may be employed in voice or telephony applications. 



6 
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DETAILED DESCRIPTION OF SPECIFIC EMBODIMENTS 
Referring now to Figs, la and lb, a block diagram of a signal processor 30 is 
shown for processing audio signals according to a specific embodiment of the present 
invention. In this embodiment, signal processor 30 is implemented entirely in software 
5 and may be incorporated, for example, within a server distributing digital audio files or 
streaming audio, or within any of a variety of other devices including, for example, 
digital radio transmitters and receivers, standard PCs, cell phones, personal digital 
assistants (PDAs), wireless application devices, portable playback devices, set top boxes, 
etc. 

10 The input block 32 in Fig. la receives audio signals fi-om an audio source (not 

shown). The input block 32 converts the audio signals into pulse code modulated (PCM) 
samples according to any of a wide variety of well known digital encoding schemes. 
Subsequently, at the firequency shaping block 34, the very low fi-equency components of 
the PCM samples are elitninated which may otherwise degrade the audio quality of the 

15 samples. According to a specific embodiment, block 34 is a high pass filter (e.g., 5 Hz) 
which removes the DC offset. 

At the 2-band crossover block 36 the audio samples are separated into two 
partially overlapping firequency bands. According to a specific embodiment, all of the 
crossover blocks in processor 30 have a relatively shallow characteristic so that each 

20 band blends nicely Avith adjacent bands. Each frequency band is subsequently processed 
at non-linear automatic gain control (AGC) loop blocks 38 and 40 which, according to a 
specific embodiment, have less aggressive attack and release times than subsequent 
AGCs and are primarily for putting the signal level into the "sweet spot" of the 
subsequent multi-band crossover block 44. 



7 
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In the non-linear AGC loops 38 and 40 each of the input samples is multiphed by 
a number known as the gain factor. Depending on whether the gain factor is greater or 
lower than 1.0, the volume of the mput sample is either increased or decreased for the 
purpose of equalizing the amplitude of the input samples in each of the frequency bands. 
5 The gain factor is variable for different input samples as described in more detail below. 
The distinguishing factor between a non-linear AGC and an AGC is that the gain factor 
varies according to a nonlinear mathematical fimction in the non-linear AGC. Thus, the 
output of each of the non-linear AGCs 38 and 40 is the product of the input sample and 
the gain factor. According to a specific embodiment, AGCs 38 and 40 operate m a 

1 0 manner sunilar to that described below with reference to AGC 48 in processing block 60 
of Fig. lb. The outputs of the two non-lmear AGCs are mixed at the mixer block 42 so 
that in the resvdting output all the frequencies are represented. 

At the next block, multi-band crossover 44, the audio samples are divided into n 
overlapping freqxxency bands, where n = 3 or more. For a 5-band processor the bands 

1 5 may mclude, for example, sub-bass, mid-bass, mid-range, presence, and treble. Multi- 
band crossover 44 behaves very similar to 2-band crossover 36 except that the former 
has more frequency bands. 

Because the samples are divided into multiple frequency bands, the volume in 
each frequency band may be equalized separately and independently from the other 

20 frequency bands. Independent processing of each frequency band is desirable where 
there is a combination of high-pitch, low-pitch and medium-pitch instruments playing 
simultaneously. In the presence of a high-pitch sound, such as crash of a symbol that is 
louder than any other instrument for a fraction of a second, a single band AGC would 
reduce the amplitude of the entire sample mcluding the low and medium frequency 

25 components present in the sample that may have originated from a vocalist or a bass. 
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The result is a degradation of audio quaHty and introduction of undesirable artifacts into 
the music. A one band AGC would allow the component of frequency with the highest 
volume to control the entire sample, a phenomenon referred to as spectral gain 
intermodulation. 

Referring now to Fig. lb, each frequency band is independently processed by 
processing blocks 60, 62, and 64. Processing block 60 is dedicated to processing band 1 
with components possessing the lowest frequency. Drive block 46 is a user 
programmable gain adjustment which uniformly exaggerates the signal component as it 
goes into AGC 48 which works to reduce changes in the gain. For every Nth sample that 
doesn't overshoot its threshold, AGC 48 incrementally increases the gain. Likewise, for 
every Nth sample which does overshoot the threshold, AGC 48 incrementally decreases 
the gain. 

Drive block 50 is another user programmable gain adjustment which precedes 
negative attack time limiter (NATL) 52. Drive block 50 works in concert with inverse 
drive block 54 to adjust the effective range of operation of NATL 52. For some signal 
transients which occur quickly. AGC 48 may not react quickly enough and some 
overshooting samples would go otherwise go untreated resulting in a sharp overshoot at 
the begimiing of the transient. To deal with this, NATL 52 looks at future samples and 
limits the gain of the current sample to avoid the distortion associated with such sharp 
overshoots. In practical terms, the lower the threshold is set, the more "dense" the sound 
becomes. 

According to a specific embodiment of NATL 52, samples are stored in a delay 
buffer so that the future samples may be used in equaUzing the volume. When the buffer 
is full, a small block of earlier samples is extracted from the beginning of the buffer and 
the future block of samples is appended to the end of the buffer. The future sample is 
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multiplied by the gain factor. If the resulting data has an ampUtude greater than a 
threshold value (a user-fixed parameter) the gain factor is reduced to a value equal to the 
threshold value divided by the ampUtude of the future sample. A counter referred to as 
the release counter is subsequently set equal to the length of the delay buffer. Hie 

5 resulting data are then passed through a low-pass filter so as to smooth out any abrupt 
changes m the gain that wiU have resulted firom multiplication by the fiihire sample. 

Finally, the sample in the buffer which has been delayed is multiplied by the gain 
factor described above in order to produce the output. Subsequently, the release counter 
is decremented. If the release counter is less than zero, tiie gain factor is multiphed by a 

10 number slightly greater than 1 .0. Finally, the next sample is read and the above process 
is repeated. NATL 52 ensures that the transition firom the present sample to the future 
sample is achieved m a smooth and inaudible fashion, and removes peaks on the audio 
signal that waste bandwidth. 

According to a specific 5-band audio implementation of processor 30, processing 

15 block 60 may include a soft clip block 56 which corresponds to a nonlinear fimction 

which essentially rounds off the waveform, creating harmonics which, in tum, create the 
effect that there is more bass than there is in the input signal. That is, within an output 
signal excursion which is less than the peak-to-peak excursion of the input signal fi:om 
drive block 54 there is substantially more acoustic energy. 

20 The level mixer block 58 is another gain control wherein the sample is multiphed 

by a constant gain factor that may be preset by the user. Remixing of the signal 
components in the different firequency bands is performed at the mixer block 66. 
Another user programmable gain control 68 for general loudness is followed by a final 
NATL 70 which limits the total peak of the combined bands in the same way as 

25 discussed above with reference to NATL 52. The Imiitmg fimction performed by NATL 

10 
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70 is desirable, for example, where constructive interference between peaks in different 
bands causes peaks which need to be dealt with. Finally, the output of signal processor 
30 in the form processed audio samples is transmitted via output block 72. 

Fig. 2 shows the four stages of a 5-band crossover block 80 which may be 
employed as a specific embodiment of multi-band crossover 44 of Fig. la. Crossover 
block 80 represents a series of linear operations to separate signals into overlapping 
frequency bands. At each stage of the multi-band crossover 80 (as shown in Fig. 3) a 
computation is performed resulting in a high pass output as shown in the loop 90. More 
specifically, at each stage corresponding to a particular frequency band only the output 
from the previous stage, referred to as the high pass output, is read. An averaging 
process is then performed wherein the weighted sum of the previous stage's output and 
the new sample is computed. 

The output of the averaging process is referred to as the low-pass output in Figs. 
2 and 3. Thus, there are n-1 low pass outputs corresponding to the n frequency bands. 
The difference between the input sample and the low pass output is denoted as the high 
pass output which forms the input to the next stage of the multi-band crossover. Fig. 2 
shows four stages corresponding to the 1^, 2"'', 3"''', and 4* stages of the multi-band 
crossover labeled 82-88, respectively. 

Fig. 4 shows a flowchart illustrating operation of a specific embodiment of an 
AGC loop 98 which may be employed, for example, to implement AGC 48 of Fig. lb. 
AGC loop 98 appUes a gain factor to each sample it receives. Initially the gain factor is 
assumed and thereafter for each sample, as indicated at 92, the gain factor is increased 
sHghtly through multiplication by a number greater than 0.0 referred to herein as the 
release rate parameter. In this way, the gain factor increases with every sample. Every 
input sample is multiplied by the gain factor thus obtained, as indicated at 94. 

11 
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At 96 it is determiaed if the amplitude of the sample with the gain factor applied 
exceeds a preset threshold value. Li the event the threshold value is exceeded, the gain 
factor is reduced slightly through multiplication by a number greater than 0.0 referred to 
herera as the attack rate parameter. Otherwise the gain factor remains unaltered and the 
process repeats by reading a new input sample. 

Fig. 5 shows a flowchart illustrating operation of a specific embodiment of a 
special AGC loop 100 which maybe employed, for example, to implement AGC 38 of 
Fig. lb. The non-linear AGC loop 100 apphes a gain factor to each sample it receives. 
At 102, the gain factor is increased for every sample by multiplying the gain factor with 
a number shglitly greater 1.0, i.e., the release rate parameter. At 104, a trial 
multiphcation is performed by multiplying each input sample with the gain factor. If the 
amplitude of the resulting signal is greater than a preset threshold value, the gain factor is 
reduced slightly by multiplication with a number slightly less than 1.0, i.e., the attack 
rate parameter. The gain factor is then modified according to a nonhnear function. 

According to one embodiment of the present invention, the new gain factor is 
obtained by dividing the old gain factor by two and adding a fixed value to the outcome, 
thereby obtaining a nonlinear variation in the gain factor. The final output of the non- 
linear AGC loop 100 is obtained by multiplying each input sample by the modified gain 
factor. Thereafter, the process is repeated for the incoming new input samples. 

Various embodiments of the present invention are implemented entirely in 
software. In one embodiment, a Pentium processor within a standard PC is programmed 
in assembly language to perform the generaUzed signal processing depicted in Figs, la 
and lb, resulting in considerable reduction in both expense and complexity. 
Furthermore, the present invention is implemented in real-time, makmg it particularly 
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desirable for use in the transimssion of audio signals over any digital network such as the 
Internet. 

Fig. 6 depicts one application of the present invention wherein audio files are 
played over a digital network with dynamic processing optimization. Fig. 6 shows a 
5 communication system 120 comprising an audio server 106, a digital network 1 1 0, a PC 
1 14 and speakers 1 1 8. Audio server 106 is coupled to the digital network 110 through 
transmission line 108, which may be a Tl hne. Digital network 1 10 is coupled to the PC 
1 14 through the transmission line 1 12 and the PC 1 14 is coupled to the speakers 118 
through the line 116. 

10 Within the audio server 1 06, which may be a PC or several connected PC's, are 

several blocks for the processing of audio signals. The audio files 122 stored on a disk 
may be encoded using any of a variety of encoding algorithms such as, for example, the 
MPS encoding scheme. The audio files are played at 124 using a decoding software, 
e.g., Winamp, and are subsequently converted to PCM samples. The PCM samples are 

1 5 then processed by the signal processing software 126, embodiments of which are 
described herein, e.g., the processor of Figs, la and lb. 

The output of the signal processing software 126 is encoded again using any 
desired encoding algorithm, e.g., MPS, and is transmitted through the line 108, across the 
digital network 1 1 0, and through the line 1 12 to the PC 1 14. Inside the PC 1 14, 

20 equipped with the appropriate decoding software such as Winamp, the samples are 

decoded and converted into audio signals which are then fed to the speakers 1 1 8 through 
the line 116. 

Fig. 7 shows another generalized application of the present invention wherein a 
user is playing audio files stored in a digital audio playback device 130. Speaker 134 is 
25 coupled to playback device 130 through the line 132. Playback device 130 may 
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comprise, for example, any of a wide variety of consumer electronic devices which 
would benefit firom the signal processing innovations of the present invention such as a 
personal computer, any component of a home entertainment system, a handheld 
communication device, a portable CD or MPS player, etc. For example, playback device 
5 130 might be part of an audio system located inside a user's car, the dynamic processing 
capabilities of the invention being employed to improve the quality of sound in the 
presence of the background noise typical in such an environment. 

Audio files 136, encoded using any of a variety of encoding techniques, are 
decoded by decoding software 138 (e.g., Winamp) and are converted to PCM samples. 

10 The PCM samples are processed by signal processing software 140 designed according 
to any of the various embodiments of the present invention. 

It should be noted that signal processing software 140 may employ a greater or 
fewer number of firequency bands and processing blocks than various ones of the 
embodiments described herein. That is, for different applications, a greater or lesser 

15 amount of processing resources are available to effect the signal processing techniques of 
the present invention. For example, the available number of processing cycles in a small 
portable playback device such as an MPS player may be limited. By contrast, such 
limitations may not exist for an audio server such as server 106 of Fig. 6. 

The output of signal processing software 140 is finally converted to audio signals 

20 at conversion block 142 (which, in a PC, may be a sound card) which drives speakers 
134 via line 132. 

Fig. 8 shows yet another application of the present invention wherein the signal 
processing techniques described herein are employed at the receiving end of a network 
communication system. Shown in Fig. 8 is a communication system 170 including an 
25 audio server 150, a digital network 154, a PC 158, and speakers 162. The audio server 
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150 is coupled to the digital network 154 through the transmission line 152, the digital 
network 154 is coupled to the PC 158 through the transniission line 156, and the PC 158 
is linked to the speakers 162 through the line 160. 

The audio server 150 in this case may or may not include signal processing 
software designed according to any of tiie embodiments of the present invention. 
Encoded PCM samples are transmitted from the audio server 150 through the 
transmission line 152, across the digital network 154 and through the transmission line 
156 to the PC 1 58. Inside the PC 158, the PCM samples are decoded at 164 using the 
appropriate decoding software. The decoded PCM samples are processed by signal 
processing software 166. The output of the signal processing software 166 is converted 
into audio signals by the sound card driver 168 which drives speakers 162 via line 160. 

The AGC and NATL blocks used in the various embodiments of the present 
invention are quite similar with the differences being largely due to the adjustment of 
time constants, i.e., the attack and release times, for different implementations and for 
different effects within the same implementation. That is, a particular desired sound 
might affect the attack and release times specified for specific blocks. In addition, 
available processing resources might affect the number of bands and/or blocks per band 
in a particular implementation, e.g., a small cycle budget in an MP3 player vs. a large 
cycle budget in a music file server. 

As the bandwidth of encoders are reduced relative to the bandwidth of the 
original audio, undesirable audible artifacts are generated. The present invention 
processes the audio samples such that these anticipated artifacts become less noticeable 
to the human ear. That is, the signal processing of the present invention allows a low bit 
rate encoder to be used to encode an audio stfeam without suffering overly much from 
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the undesirable artifacts created by trying to faithfully reproduce a high bandwidth signal 
(the original audio) with a low bandwidth system (the low bit rate codec). 

In addition to facilitating the bandwidth savings represented by low bit rate 
encoders, the signal processing of the present invention may have other desirable effects 
5 such as, for example, the improvement of clarity in the presence of background noise and 
cut-to-cut evenness. 

A generalized topology of the present invention includes three dijfferent kinds of 
blocks, AGCs (including NATLs), drive blocks (e.g., drive blocks 46, 50 and 54 of Fig. 
lb), and filter blocks (e.g., crossovers 36 and 44 of Fig. la). Signal processing networks 
10 combining these three elements in any of a wide variety of ways are considered within 
the scope of the invention. As described above, filter or crossover blocks typically are 
employed to perform a series of linear operations to separate signals into overlapping 
frequency bands. 

In general, the AGC blocks of the present invention examine the recent history 
15 and/or immediate future of the signal and use this information to adjust a gain factor such 
that the si^al is kept within a range of peak excursion. Different implementations of 
such blocks in various embodiments differ as to how much of the signal is used to make 
these adjustments, and how fast or how often the adjustments are made. Also specified 
is the range of signals desired to be maintained at the output e.g., use of a threshold to act 
20 or not act in, for example, a NATL. In addition, once the gain value to be applied is 

determined, a further nonlinear fimction may be applied to die gain value before applying 
it to the current sample. Finally, the gain value may also be calculated wiUh. reference to 
the input signal level. Both feed forward and feed back AGC topologies may be 
employed according to various embodiments of the invention. There are two 
25 fundamental types of AGCs employed by the various embodiments of the invention, 1) 
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the limiter type (e.g., NATL 52 of Fig. lb), and 2) the dynamic range control type (e.g., 
AGC48ofFig. lb). 

Drive blocks are simply preset level controls for putting samples in the sweet spot 
for subsequent processing bIock(s). Putting the processing block(s) between a drive 
5 block and an inverse drive block allows the processing block(s) to operate within its 
normal range while moving the effective raage relative to the audio signal. 

According to a specific embodiment, the efficiency with which the fondamentai 
blocks of the signal processors of the present invention operate relates in part to the use 
of low-precision integer arithmetic to implement the blocks' functions. According to a 
10 more specific embodiment, separation of the work of the AGC and the NATL into two 
independent stages also contributes to efficiency and sound quality. 

Additional embodiments of the present invention will now be described with 
reference to Figs. 9a and 9b and subsequent figures. Figs. 9a and 9b show a 5-band 
signal processor 900 designed according to a specific embodiment of the present 
15 invention. It should be noted that the processing blocks of processor 900 operate in a 
similar manner to the corresponding blocks of processor 30 described above with 
reference to Figs, la and lb. It should also be understood that processor 900 may be 
employed for a wide variety of apphcations, particularly those application which have 
sufficient processing overhead to accommodate the associated computational load 
20 presented by this configuration. 

Referring now to Fig. 9a, the received digital audio samples are high pass filtered 
in filter block 902 to suppress the DC component and other unnecessary signal 
components below 5 Hz. The filtered samples are then pre-processed in one of four 
parallel paths referred to herein as the "transparent," "dual brick wall," "wideband," and 
25 "brick wall" paths, respectively. 

17 
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According to a specific embodiment of the invention, the "transparent" path 
divides the audio into two bands (bass and master) and processes them individually (with 
the bass band coupled to the master band). This can be thought of as a standard mode 
having neghgible effect. The "dual brick wall" path is the same as the "transparent" path 
5 except that it is more audible in its gain changes. The "wideband" path processes the 
full-range audio with only one AGC. This provides slight spectral gain intermodulation 
which, in some embodiments, is exploited by the certain presets (e.g., rock presets). The 
"brick wall" path is like the "wideband" path but provides considerable spectral gain 
intermodulation which, according to various embodiments, may be exploited by certain 
10 presets (e.g., so called club or house presets). 

The pre-processed audio is then divided into five firequency bands using 2-way 
crossover blocks 952-955 having cutoff frequencies of 80 Hz, 200 Hz, 2 kHz, and 8 kHz, 
respectively. This may be accomplished, for example, as described above with reference 
to the multi-band crossover of Fig. 3. The samples in each of Bands 1-5 are then 
15 subjected to further processing as follows. 

Noisegate blocks 961-965 remove components of the audio signal that are below 
a certain level of amplitude. Delay blocks 956-960 are used by noisegate blocks 961-965 
for look-ahead/negative attack time. 

Drive blocks 966-970 represent user programmable gain adjustments which 
20 uniformly exaggerate the received signal component as it goes into the following AGC 
block (i.e., 971-975) which works to reduce changes in the gain. According to a specific 
embodiment, for every nth sample that doesn't overshoot its threshold, each of AGC 
blocks 971-975 incrementally increases its gain. Likewise, for every mth sample which 
does overshoot the threshold, each of AGC blocks 971-975 incrementally decreases the 



18 



wo 02/25886 



PCT/USOl/29552 



gain. According to a more specific embodiment, the release function of AGC blocks 
971-975 is given by: 

gain = gain + (gain * release) 

5 

and the attack ftmction of AGC blocks 971-975 is given by: 

gain = gain - (gain * attack) 

10 where "release" and "attack" represent the release aad attack time constants, 
respectively. 

Drive blocks 976-980 are another set of user programmable gain adjustments 
which precede negative attack time limiters (NATLs) 981-985. For some signal 
transients which occur quickly, AGCs 971-975 may not react quickly enough and some 

15 overshooting samples would go otherwise go untreated resulting in a sharp overshoot at 
the beginning of the transient. To deal with this, NATLs 981-985 look at future samples 
and limit the gain of the current sample to avoid the distortion associated with such sharp 
overshoots. The lower the threshold is set, the more "dense" the sound becomes. 

Each of drive blocks 986-990 is the inverse of the corresponding one of drive 

20 blocks 976-980. Each of drive blocks 976-980 works in concert with the corresponding 
one of inverse drive blocks 986-990 to adjust the effective range of operation of the 
corresponding one of NATLs 981-985. In addition, in band 1, e.g., sub-bass, drive block 
986 feeds soft clip block 991 which corresponds to a nonlinear function which 
essentially rounds off the waveform, creating harmonics which create the perception that 
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there is more bass than there is, i.e., within the same peak-to-peak excursion of the input 
signal there is a lot more acoustic energy in the output because of the harmonics. 

Mixer block 992 which has independently controllable gain for each band is 
followed by a final NATL 993 which limits the total peak of the combined bands, e.g., 
5 constructive interference between peaks in different bands may cause peaks which need 
to be dealt with. NATL 993 is followed by Clip block 994 which removes any 
remaining overshoots from the signal. 

Figs. 10a and 10b show another 5-band signal processor 1000 designed according 
to yet another embodiment of tiie invention. This embodimeut of the invention has an 
10 advantage with respect to processor 900 of Figs. 9a and 9b in that it represents a lower 
load on the system's overall processing resources, i.e., it has a lower cycle budget, due to 
a few simpUfications. It should also be noted that, with some exceptions noted below, 
the processing blocks of processor 1000 operate in a similar manner to the corresponding 
blocks of processors 30 and 900 described above. Indeed, as can be seen in Fig. 10a, the 
15 input samples are pre-processed in one of four parallel paths in much the same way (with 
the exception of the band-pass filters) as described above with reference to Fig. 9a. 

The preprocessed audio is then divided into five frequency bands using two 
three-way crossover blocks 1052 and 1054, each having cutoff frequency pairs of 80 and 
400 Hz, and 2 and 8 kHz, respectively (instead of the four crossovers 952-955 in Fig. 
20 9b). hi addition, crossover blocks 1052 and 1054 include independent user 

programmable gain controls which elimmate the need for the subsequent drive blocks in 
other embodiments. The samples in each of Bands 1-5 are then subjected to further 
processing as follows. 

According to a specific embodiment, for every sample received that doesn't 
25 overshoot its threshold, each of AGC blocks 1070-1074 incrementally increases its gain. 

20 



wo 02/25886 



PCT/USOl/29552 



Likewise, for every sample whicli does overshoot the threshold, each of AGC blocks 
1070-1074 incrementally decreases the gain. According to a more specific embodiment, 
the release function of AGC blocks 1070-1074 is given by: 

5 gain = gain + (gain/(2''release)) 

and the attack function of AGC blocks 1070-1074 is given by: 

gain = gain - (gain/(2'^attack)) 

10 

where "release" and "attack" represent the release and attack time constants, 
respectively. 

For some signal transients which occur quickly, AGCs 1070-1074 may not react 
quickly enough and some overshooting samples would go otherwise go untreated 
15 resulting in a sharp overshoot at the beginning of the transient. To deal with this, 
NATLs 1080-1084 look at future samples and hmit the gain of the current sample to 
avoid the distortion associated with such sharp overshoots. 

In addition, in the lowest frequency band, e.g., sub-bass, soft clip block 1090 
corresponds to a nonlinear function which essentially rounds off the waveform, creating 
20 harmonics which create the perception that there is more bass than there is, i.e., within 
the same peak-to-peak excursion of the input signal there is a lot more acoustic energy in 
the output because of the harmonics. 

Mixer block 1091 which has independently controllable gain for each band is 
followed by a final NATL 1092 which lunits the total peak of the combmed bands, e.g., 
25 constructive interference between peaks in different bands may cause peaks which need 
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to be dealt with. NATL 1092 is followed by CHp block 1093 which removes any 

remaining overshoots from the signal. 

Fig. 1 1 shows a 4-band signal processor 1100 designed according to still another 

embodiment of the invention. This embodiment of the invention presents an even lower 
5 load on processing resources than the previously described embodiments due to 

additional simplification. As such, this embodiment is particularly amenable to 

applications in which a fairly sophisticated level of signal processing is desired, but 

which have a paucity of processing resources, e.g., portable digital audio players such as 

MP3 and CD players. It should also be noted that, with some exceptions noted below, 
10 the processing blocks of processor 1 100 operate in a similar manner to the corresponding 

blocks of processors 30, 900, and 1000 described above. 

The received audio samples are divided into four frequency bands using one 

three-way crossover block 11 52 and one two-way crossover block 11 54, having cutoff 

frequencies of 80 and 400 Hz, and 2 kHz, respectively. In addition, crossover blocks 
15 1 152 and 1154 include independent user programmable gain controls which eliminate 

the need for the subsequent drive blocks in other embodiments. 

According to a specific embodiment, for every sample received that doesn't 

overshoot its threshold, each of AGC blocks 1 170-1 173 incrementally increases its gain. 

Likewise, for every sample which does overshoot the threshold, each of AGC blocks 
20 1 170-1 173 incrementally decreases the gain. According to a more specific embodiment, 

the release fimction of AGC blocks 1 170-1 173 is given by: 

gain = gain + (gain/(2^release)) 

25 and the attack fimction of AGC blocks 1 170-1 173 is given by: 
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gain = gain - (gain/(2'^attack)) 

where "release" and "attack" represent the release and attack time constants, 
5 respectively. 

Mixer block 1191 which has independently controllable gain for each band is 
followed by a final NATL 1 192 which limits the total peak of the combined bands, e.g., 
constructive interference between peaks in different bands may cause undesirable peaks 
in the output signal. 

10 Specific applications will now be described with reference to Figs. 12a through 

14. It will be understood that the systems depicted are merely examples of systems 
which would benefit from utilization of various ones of the signal processing techniques 
of the present invention. As described above, there are a great many more applications 
for these techniques contemplated which are within the scope of the present invention. 

15 Recent and ongoing developments in the digital radio industry will eventually 

result in a high-quality digital path from the broadcaster to the consumer which is largely 
devoid of dynamic range limitations and the problematic requirement of pre-emphasis. 
The complete digitization of the audio delivery chain will mean that audio will remain in 
the digital domain for the entire path from the original recording to the consumer while 

20 maintaining its original quality and dynamic range, a feat only previously possible, for 
example, when listening directly to a compact disc player. 

The preservation of vntually all of the audio signal's dynamic range by such 
systems will allow a much wider dynamic range control than previously possible, 
enabling ever more sophisticated processing of the audio signal for artistic and other 

25 purposes. Unfortunately, regardless of the level of processing sophistication, the digital 
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broadcaster cannot currently provide a digital audio signal which is appropriate for every 
listening environment, not to mention for every listener's preference. The best the 
broadcaster can hope to do is to process the audio signal for a particular "signature" 
sound with reference to some normalized "lowest common denominator" listening 
experience. Such an approach severely hmits the djmamic range of tiie deUvered signal, 
often making the listening experience unsatisfactory for a substantial number of listeners. 

Many of the drawbacks of current digital broadcasting schemes relate to the fact 
that the audio processing occurs at the source of the audio signal, i.e., the digital 
broadcaster's radio transmitter, and as a result cannot meet the specific needs of each 
individual listener. Therefore, according to a specific embodiment of the present 
invention, a digital broadcasting system is proposed in which the digital signal 
processing techniques of the present invention are employed to overcome this problem. 
That is, processing capabilities are provided in the radio receiver which will allow 
customization of the listening experience according to each listener's preferences. 

Figs. 12a and 12b are simplified block diagrams of a digital audio broadcasting 
(DAB) station 1200 and a DAB receiver-side system 1250, respectively. Radio station 
1200 receives the program audio signal which may be an analog signal which is 
subsequently converted to a digital signal by A/D converter 1202 or an AES/EBU digital 
signal, one of which is then encoded using the station's codec 1204. The resulting AES 
digital audio signal is then provided to IBOC exciter 1206 which uses it to modulate a 
broadcast RF signal. 

The output AES digital signal is also provided to a signal processor 1208 
designed according to the present invention. According to a more specific embodiment, 
processor 1208 comprises processor 900 of Figs. 9a and 9b. However, it will be 
understood that any of a variety of embodiments of the invention may be used. 

24 
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Processor 1208 is configured by the digital broadcaster via control interface 1210 
to effect a variety of goals including, for example, providing the station's "signature" 
sound. The resulting audio signal may be monitored by the broadcaster's personnel via 
an off air monitor 1212 which receives both a processed AES/EBU digital signal and a 
5 two-channel processed audio signal provided by D/A converter 1214. In this way, the 
broadcaster's desired sound can be achieved. 

Unlike previously described embodiments, processor 1208 does not process the 
digital audio prior to transmission. Listead, low speed digital data representing the 
desired processor configuration are provided to exciter 1206 for transmission on the RF 

10 signal along with the digital audio. These data may then be employed by the Ustener's 
system to configure a corresponding signal processor on the receiver side to process the 
digital audio signal in accordance with the broadcaster's programmed scheme. The 
configuration data set may include any of the parameters for any of the processor blocks, 
and may be less or more inclusive according to the broadcaster's design. 

15 Referring now to Fig. 12b, DAB receiver-side system 1250 includes a DAB 

receiver 1252 and a compact disc (CD) player 1254 each of which may be controlled by 
the user via control circuitry 1256 which may mclude, for example, a remote control (not 
shown). As shown in the figure, the user may select between receiver 1252 and CD 
player 1254 as the audio source. 

20 If the user selects DAB receiver 1252, both the PCM audio data and the low 

speed processor configuration data sent by station 1200 are provided to signal processor 
1258 which, according to a specific embodiment comprises processor 900 of Figs. 9a and 
9b. It will, however, be xmderstood that any of a wide variety of implementations may 
be used. Processor 1258 is configured according to the received low speed data and 

25 processes the digital audio data accordingly. The Hstener may customize the 
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configuration of processor 1258, augmenting or completely overriding the broadcaster's 
default configuration using control interface 1260 which, according to the embodiment 
shown, is also operable to control the system's volume, balance, and fader functions 
represented by block 1262. 

5 Processor 1258 provides the processed digital audio samples to D/A converter 

1264 which, in turn, provides the converted analog signal to volume/balance/fader block 
1262, the output of which is provided to amplifiers 1266-1269 which drive speakers 
1270-1273, respectively. 

In this way, the listening experience provided by the digital broadcasting system 

10 can be customized to conform to each listening environment and according to each 

listener's preference, while retaining some level of control for the baseline experience in 
the hands of the broadcaster. That is, according to various embodiments, the user is 
given the option of selecting the predefined default processing configuration provided by 
the digital broadcaster, altering that configuration in some way, or completely 

15 overriding. -The integration of these capabilities into the listener's system is made 
possible, at least in part, by the fact that the processing techniques of the present 
invention may be implemented with a very small impact on the processing resources 
already available in most such systems. 

In fact, the low impact of the signal processors of the present invention makes 

20 these processor ideal for integration into a wide variety of applications. One such 
application is in a satelhte television system such as the one shown in Fig. 13. As 
represented by boxes 1302, 1304, and 1306, satellite system 1300 employs a variety of 
disparate sources for the content it transmits to customers. This typically results in an 
uneven loudness across different channels and even for different content on a single 

25 channel which is undesirable firom the end user's perspective. 
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This may of course be dealt with by integrating the processing techniques of the 
present invention into the satellite system's headend equipment. However, as discussed 
above with reference to the digital broadcasting context, this only addresses part of the 
problem. It still does not allow for customization of the individual user's hstening 
experience. Therefore, according to the embodiment of the present invention, the 
processing techniques of the present invention are integrated into the user's equipment in 
much the same way as in the digital broadcasting system to provide the desired signal 
processing capabilities. 

Referring again to Fig. 13, different types of content (1302, 1304, and 1306) are 
provided to the headend's satellite uplmk 1308 which may or may not include some level 
of signal processing capability either according to the present invention or some other 
technique. The content is transmitted to satellite 1310 which then transmits the content 
to a user's antenna 13 12 for decoding by a set top box 1 3 1 4 and presentation on 
television 1316. According to one embodiment, a signal processor designed according to 
the present invention (e.g., processor 1 100 of Fig. 1 1) is included in set top box 1314 and 
may be configured according to configuration data transmitted along with the content by 
the satellite provider in a manner similar to that described above with reference to Figs. 
12a and 12b. Alternatively, a default configuration may be provided in the set top box 
itself In either case, the user can either alter or override the default processor 
configuration using, for example, a menu driven interface which is accessed via 
television 1316 and an associated remote control (not shown). It will be understood, of 
course, that the preceding discussion applies equally well to a cable television system. 

According to an alternate embodiment, a signal processor designed according to 
the invention is provided in the television set itself La fact, any system which includes 
audio derived from disparate sources may benefit from the signal processing and 
27 
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normalization capabilities of the present invention. For example, referring now to Fig. 
14, a home entertainment system 1400 may include multiple sources of audio signals 
such as a CD player 1402, an FM radio receiver 1404, and an MP3 player 1406. These 
audio signals may be received by a receiver 1408 which amplifies them using power amp 
5 1410 which drives speakers 1412. As shown, receiver 1408 includes a signal processor 
1414 designed according to the present invention which may be configured to eliminate 
the unevenness resulting firom the differences between the audio sources, and which 
allows the user to customize the listening experience according to his preferences. 

It will be understood that this idea may be further generalized to encompass the 

10 integration of a signal processor designed according to the invention into any electronic 
device or system which employs audio. This may include the types of devices discussed 
above, e.g., televisions, CD and MP3 players, car stereos, radios, etc. It may also include 
recording devices such as video and tape recorders, Mini Disc recorders, etc. The 
techniques of the invention may also be applied to any type of telephony or voice 

1 5 communication system whether over conventional telephone lines, the Internet, or in the 
wireless environment. An example of a multi-band processor for voice applications will 
now be described witli reference to Fig. 15. 

Fig. 15 shows a 3-band signal processor liSOO which may be employed, for 
example, in voice or telephony applications. The input audio is pre-processed by AGC 

20 1501 . The pre-processed audio is then divided into three fi-equency bands using 2-way 
crossover blocks 1502 and 1504 having cutoff frequencies of 1000 Hz and 2000 Hz, 
respectively. This may be accomplished, for example, as described above with reference 
to the multi-band crossover of Fig. 3. The samples m each of Bands 1-3 are then 
subjected to fiarther processmg as follows. 
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Noisegate blocks 1512-1516 remove components of the audio signal that are 
below a certain level of amplitude. Delay blocks 1518-1522 are used by noisegate 
blocks 1512-1516 for look-ahead/negative attack time. Drive blocks 1518-1522 
represent user programmable gain adjustments which umformly exaggerate the received 
signal component as it goes into the following AGC block (i.e., 1524-1528) which works 
to reduce changes in the gain. According to a specific embodiment, for every nth sample 
that doesn't overshoot its threshold, each of AGC blocks 1524-1528 incrementally 
increases its gain. Likewise, for every mth sample which does overshoot the threshold, 
each of AGC blocks 1524-1528 incrementally decreases the gain. According to various 
embodiments, the release function of AGC blocks 1524-1528 may correspond to any of 
the functions described above. 

Drive blocks 1530-1534 are another set of user programmable gain adjustments 
which precede negative attack time limiters (NATLs) 1536-1540. For some signal 
transients which occur quickly, AGCs 1524-1528 may not react quickly enough and 
some overshooting samples would go otherwise go untreated resulting in a sharp 
overshoot at the beginning of the transient. To deal with tiiis, NATLs 1536-1540 look at 
future samples and limit the gain of the current sample to avoid the distortion associated 
with such sharp overshoots. The lower the threshold is set, the more "dense" the sound 
becomes. 

Each of drive blocks 1542-1546 is the inverse of the corresponding one of drive 
blocks 1530-1534, each of which works in concert with the corresponding one of inverse 
drive blocks to adjust the effective range of operation of the corresponding one of 
NATLs. Mixer block 1548 which has independently controllable gain for each band is 
followed by a final NATL 1550 which limits the total peak of the combined bands, e.g., 
constructive interference between peaks in different bands may cause peaks which need 

29 
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to be dealt with. NATL 1550 is foUowed by CUp block 1552 which removes any 
remaining overshoots from the signal. 

The maimer in which the signal processing techniques of the present invention 
facihtate the bandwidth reduction of an audio encoding scheme such as MPS encoding 
5 relates to yet another set of embodiments. According to these embodiments, the benefits 
of the invention may be realized even without real-time application of the associated 
signal processing techniques to the digital audio. That is, any sequence of digital audio 
samples may be processed using a signal processor designed according to the present 
invention to generate audio files to be stored for playback at a later time. 

10 For example, a provider of MP3 files to be downloaded over the Internet is not in 

a position to provide the same real-time processing as a provider of streaming audio. 
Nevertheless, the benefits of the present invention may be enjoyed by the provider and 
the user of such downloaded files even if the user does not have the signal processing 
capabilities of the present mvention. That is, the provider of the MP3 files can apply the 

15 signal processing techniques of any of the embodiments of the present invention to any 
MP3 files, and then stbre the processed MP3 files for serving to users over the hitemet. 
The files may then be downloaded and played using any of the available 
decoders/players, and the listening experience will be very much the same as if the 
processing techniques of the invention were being apphed in real tune. The 

20 preprocessing can be for any of the desired effects described above witli reference to the 
various embodiments of the invention such as, for example, mitigating tiie undesirable 
artifacts of a low bit rate codec or providing a "signatare" sound for the provider of the 
audio files. 

Another example of a sitiiation m which the benefits of the present invention may 
25 be enjoyed without the real-time processmg of the audio samples is the production and 
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distribution of recording media, e.g., compact discs, having audio files stored therein 
which have been prq)rocessed according to the present invention. That is, the 
manufacturer or distributor of audio CDs can preprocess the audio to be distributed on a 
CD for any of the purposes described above, e.g., providing a default sound for a 
5 particular type of music. 

While Ihe invention has been particularly shown and described with reference to 
specific embodiments thereof, it will be understood by those skilled in the art that 
changes in the form and details of the disclosed embodiments maybe made without 
departing from the spirit or scope of the invention. That is, tiie basic building blocks of 

10 the specific configurations described, e.g., AGCs, negative attack time limiters, and drive 
blocks, may be combined in a wide variety of ways to provide highly efficient multi- 
band signal processing for a similarly wide variety of applications. Factors such as 
desired fidelity, available transmission bandwidth, and available processing overhead 
may interact to dictate different optimal configurations for different applications. 

15 Additionally, various embodiments have been described herein with reference to 

implementation in software. However, it will be understood that the basic signal 
pYocessing blocks of such embodiments may be implemented in other ways and remain 
within the scope of the invention. For example, these processing blocks may be 
implemented in application specific integrated circuits (ASICs) or programmable logic 

20 devices (PLDs). Hardware implementations of the processing blocks of the present 
invention are also possible. 

Moreover, specific processor configurations have been described herein with 
reference to specific applications, e.g., streaming audio over the Internet, portable 
playback devices, set top boxes for cable and satellite television. It should be noted, 

25 however, that the configurations described above are not limited to corresponding 
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applications. Rather, any of the described processors may be configured and deployed 
for any of a wide variety of applications including any of the applications described. 

In addition, although various advantages, aspects, and objects of the present 
invention have been discussed herein with reference to various embodunents, it will be 
5 understood that the scope of the invention should not be limited by reference to such 
advantages, aq)ects, and objects. Rather, the scope of the invention should be 
detennined with reference to the appended claims. 
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WHAT IS CLAIMED IS : 

1 . At least one computer readable medium having computer program 

5 instructions stored therein for effecting multi-band processing of an original sampled 
signal, comprising: 

first instructions for separating the original sampled signal into a plurality of 
signal components each corresponding to one of a plurality of jfrequency bands; 

second instructions for independently and dynamically controlling a dynamic 
10 range associated with each one of the plurality of signal components; 

third instructions for modifying at least one signal level associated with the 
plurahty of signal components; and 

fourth instructions for combining the signal components into a processed sampled 

signal. 

15 

2. The at least one computer readable medium of claim 1 wherein the first 
instructions separate the original sampled signal into one of 3, 4, and 5 overlapping 
frequency bands. 

20 3 . The at least one computer readable medium of claim 1 wherein the second 

instructions effect nonhnear control of a gain factor associated with each of the signal 
components. 



25 



4. The at least one computer readable medium of claim 1 wherein the second 
instructions control the dynamic range associated with each of the signal components by 
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applying a gain factor to each sample of each of the signal components, the gain factor 
being dynamically adjusted. 

5. The at least one computer readable medium of claim 4 wherein the gain 
factor for each of the signal components is dynamically adjusted every first number of 
samples. 

6. The at least one computer readable medium of claim 5 wherein the first 
number comprises 64. 

7. The at least one computer readable medium of claim 4 wherein the gain 
factor for each of the signal components is dynamically adjusted with reference to a 
threshold level to which each sample of each of the signal components is compared. 

8. The at least one computer readable medium of claim 7 wherein the gain 
factor is adjusted upward using a release rate parameter where each sample is below the 
threshold level, and downward using an attack rate parameter where each sample is 
above the threshold level. 

9. The at least one computer readable medium of claim 1 wherein the tliird 
instructions limit the at least one signal level with reference to a first number of fiiture 
samples. 
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1 0. The at least one computer readable medium of claim 9 wherein a gain 
factor to be applied to a current sample is modified with reference to at least one of the 
future samples. 

1 1 . The at least one computer readable medium of claim 1 0 wherein the gain 
factor is decreased when appHcation of the gain factor to the at least one future sample 
results in the at least one future sample exceeding a threshold, and wherein the gain 
factor is decreased after the gain factor has been applied to the first number of current 



12. The at least one computer readable medium of claim 1 wherein the third 
instructions implement an independent negative attack time limiter for application to 
each of the plurality of signal components. 

15 13. The at least one computer readable medium of claim 1 wherein the third 

instructions implement a negative attack time limiter for application to the processed 
sampled signal. 

14. The at least one computer readable medium of claim 1 further comprising 
20 fifth instructions for applying at least one preset gain factor to at least one of tlie 

processed sampled signal and the plurality of signal components. 

1 5 . The at least one computer readable medium of claim 1 4 wherein the at 
least one preset gain factor comprises a pluraHty of preset gain factors, each preset gain 

25 factor corresponding to one of the plurahty of signal components. 
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1 6. The at least one computer readable medium of claim 1 5 wherein multiple 
ones of the plurality of preset gain factors correspond to each of the plurality of signal 
components. 

17. The at least one computer readable medium of claim 1 6 wherein a first 
one of the multiple ones of the pluraUty of preset gain factors for a corresponding one of 
the plurality of signal components is an inverse of second one of the multiple ones of the 
plurality of preset gain factors for the corresponding one of the plurality of signal 
components. 

1 8. The at least one computer readable medium of claim 1 7 wherein the first 
and second preset gain factors are applied to the corresponding signal component before 
and after, respectively, processing of the corresponding signal components by either of 
the second and third instructions. 

19. The at least one computer readable medium of claim 14 wherein the at 
least one preset gain factor comprises a first preset gain factor for applying to the 
processed sampled signal. 

20. The at least one computer readable medium of claim 1 wherein the fnst 
instructions implement at least one two-way crossover for separating the original 
sampled signal into the plurality of signal components. 
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2 1 . The at least one computer readable medium of claim 1 wherein the first 
instructions implement at least one three-way crossover for separating the original 
sampled signal into the plurality of signal components. 



5 22. The at least one computer readable medixmi of claim 1 wherein the first 

instructions correspond to four two-way crossover blocks for separating the original 
sampled signal into five signal components each corresponding to one of five firequency 
bands, wherein the second instructions correspond to five automatic gain control (AGC) 
blocks for independently and dynamically controlling the dynamic range associated with 

10 each one of the signal components, wherein the third instructions correspond to five 
negative attack time limiter (NATL) blocks for limiting the signal level associated with 
each of the signal components, the at least one computer readable medium fiirther 
comprising fifth instructions for applying a predetermined gain to each of the signal 
components prior to processing by corresponding ones of the NATL blocks, and sixth 

1 5 instructions for applying an inverse of the predetermmed gain to each of the signal 
components after processing by corresponding ones of the NATL blocks. 



23. The at least one computer readable medium of claun 1 wherein the first 
instructions correspond to two three-way crossover blocks for separating tlie original 

20 sampled signal into five signal components each corresponding to one of five frequency 
bands, wherein the second instructions correspond to five automatic gain control (AGC) 
blocks for independently and dynamically controlling the dynamic range associated with 
each one of the signal components, and wherein the third instructions correspond to five 
negative attack tune limiter (NATL) blocks for hmiting the signal level associated with 

25 each of the signal components. 
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24. The at least one computer readable medium of claim 1 wherein the first 
instructions correspond to a two-way crossover block and a three-way crossover block 
for separating the original sampled signal into four signal components each 

5 corresponding to one of four frequency bands, wherein the second instructions 
correspond to four automatic gain control (AGC) blocks for independently and 
dynamically controlling the dynamic range associated with each one of the signal 
components, wherein the fourth instructions correspond to a mixing block for combining 
the signal components into a mixed sampled signal, and wherein Ihe third instructions 

10 correspond to a negative attack time limiter (NATL) block for limiting a signal level 
associated with the mixed sampled signal. 



25. A system for transmitting the processed sampled signal of claim 1 
comprising the at least one computer readable medium of claim I. 

15 

26. The system of claim 25 comprising a server platform in a wide area 
network. 

27. The system of claim 25 comprising a digital radio transmission platform. 

20 

28. The system of claim 25 comprising a cellular communication system 
transmission platform. 

29. The system of claim 25 comprising a cable television transmission 
25 platform. 
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30. The system of claim 25 comprising a satellite television transmission 
platfonn. 

5 3 1 . A system for receiving the original sampled signal of claim 1 comprising 

the at least one computer readable medium of claim 1. 

32. The system of claun 3 1 comprising a client platfonn in a wide area 
network. 

10 

33 . The system of claim 3 1 comprisuig a digital radio receiver. 

34. The system of claim 3 1 comprising a portable cellular communication 

device. 

15 

3 5 . The system of claun 3 1 comprising a cable television decoder. 

36. The system of claim 31 comprising a satellite television decoder. 

20 37. A portable device comprising the at least one computer readable mediimi 

of claim 1. 

38. The portable device of claim 37 wherein the original sampled signal 
represents an audio signal and the portable device comprises a digital audio player. 
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39. The portable device of claim 38 wherein the digital audio player 
comprises a compact disc player. 



40. The portable device of claim 38 wherein the digital audio player 
5 comprises an MP3 player. 

41 . A computer implemented method for effecting multi-band processing of 
an original sampled signal, compiisiag: 

separating the original sampled signal into a plurality of signal components each 
1 0 corresponding to one of a plurality of frequency bands; 

independently and dynamically controlling a dynamic range associated with each 
one of the plurality of signal components; 

limiting at least one signal level associated with the plurality of signal 
components; and 

15 combining the signal components into a processed sampled signal. 

42. The computer implemented method of claim 41 as implemented in a wide 
area network having a server platform from which the original sampled signal originates 
and a client platform. 

20 

43 . The computer implemented method of claim 42 wherein flie separating, 
controlling, limiting, and combining occur on the server platform. 
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44. The computer implemented method of claim 42 wherein the sep arating, 
controlling, Ihnituig, and combining occur on the client platform. 
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45 . The computer implemented method of claim 41 fiirther comprising 
encoding the processed sampled signal into a compressed file format. 

46. The computer implemented method of claim 45 wherein the compressed 
file format is MP3. 

47. A method for providing data files which comprise encoded versions of 
processed files resulting from the multi-band processing of claim 41. 

48. The method of claim 47 wherein providing the data files comprises 
transmitting the data files in a wide area network. 

49. The method of claim 47 wherein providing the data files comprises 
providing at least one computer readable medium having the data files stored therein. 

50. The method of claim 47 wherein providing the data files comprises 
transmittmg the data files using a transmitter of electromagnetic waves. 

51. A computer readable medium having a data file stored therein 
representing the processed sampled signal generated using the computer implemented 
method of claim 41 . 

52. An apparatus for effecting multi-band processing of an original sampled 
signal, comprising: 
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means for separating the original sampled signal into a plurality of signal 
components each corresponding to one of a plurality of frequency bands; 

means for independently and dynamically controlling a dynamic range associated 
with each one of the plurality of signal components; 

means for limiting at least one signal level associated with the plurality of signal 
components; and 

means for combining the signal components into a processed sampled signal. 

53. A signal processor for effecting multi-band processing of an origmal 
sampled signal, comprising: 

at least one first processing block for separating the original sampled signal into a 
plurality of signal components each corresponding to one of a plurality of frequency 
bands; 

a plurality of second processmg blocks for independently and dynamically 
controlling a dynamic range associated with each one of the plurality of signal 
components; 

at least one third processing block for limiting at least one signal level associated 
with the plurality of signal components; and 

at least one fourth processing block for combining the signal components into a 
processed sampled signal. 

54. A signal processor for effecting multi-band processing of an origmal 
sampled signal, comprising: 

four two-way crossover blocks for separating the original sampled signal into five 
signal components each corresponding to one of five frequency bands; 
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five automatic gain control (AGC) blocks for independently and dynamically 
controlling a dynamic range associated with each one of the signal components; 

five negative attack time limiter (NATL) blocks for limiting a signal level 
associated with each of the signal components; 

five first drive blocks for applying a predetermined gain to each of the signal 
components prior to processing by corresponding ones of the NATL blocks; 

five second drive blocks for applying an inverse of the predetermined gain to 
each of the signal components after processmg by corresponding ones of the NATL 
blocks; and 

a mixing block for combining the signal components into a processed sampled 

signal. 

55. A signal processor for effecting multi-band processing of an original 
sampled signal, comprising: 

two three-way crossover blocks for separating the original sampled signal into 
five signal components each corresponding to one of five firequency bands; 

five automatic gain control (AGC) blocks for independently and dynamically 
controlling a dynamic range associated with each one of the signal components; 

five negative attack time limiter (NATL) blocks for limiting a signal level 
associated with each of the signal components; and 

a mixing block for combining the signal components into a processed sampled 

signal. 



56. A signal processor for effecting multi-band processing of an original 
sampled signal, comprising: 
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a two-way crossover block and a three-way crossover block for separating the 
original sampled signal into four signal components each corresponding to one of four 
frequency bands; 

four automatic gain control (AGC) blocks for independently and dynamically 
controlling a dynamic range associated with each one of the signal components; 

a mixing block for combining the signal components into a mixed sampled 
signal; and 

a negative attack time limiter (NATL) block for limiting a signal level associated 
with the mixed sampled signal. 

57. A signal processor for effecting multi-band processing of an original 
sampled signal, comprising: 

two two-way crossover blocks for separating the original sampled signal into 
three signal components each corresponding to one of three frequency bands; 

three automatic gain control (AGC) blocks for independently and dynamically 
controlling a dynamic range associated with each one of the signal components; 

three negative attack tune limiter (NATL) blocks for limiting a signal level 
associated with each of the signal components; 

three first drive blocks for applying a predetermined gain to each of the signal 
components prior to processing by corresponding ones of the NATL blocks; 

three second drive blocks for applying an inverse of the predeteimined gain to 
each of the signal components after processing by corresponding ones of the NATL 
blocks; and 

a mixing block for combining the signal components into a processed sampled 

signal. 
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r Do for each sample 




Read next sample. 
Compute weighted sum of previous filter 
outputs and new sample. Low pass output is 
the weighted sum. High pass output is the 
difference between input sample and low 
pass output. 
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FIG. 3 




Do for each sample 



For each 64th sample, increase gain factor 
slightly. This is done by multiplying by a 
number slightly greater than 1 .0 
(The release rate parameter) 

Multiply each input sample by 
gain factor 

If result is greater than threshold, reduce gain 
factor slightly. This is done by multiplying by 
a number slightly less than 1 .0 
(The attack rate parameter) 

I 
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Do for each sample 




For each 64th Sample Increase gain factor slightly. 
This is done by multiplying by a number slightly greater 
than 1.0 
(the release rate parameter) 








Do trial multiplication: 
Input sample times gain factor. If trial result is greater 
than threshold, reduce gain factor sightly. This is done 
by multiplying by a number slightly less than 1 .0 
(the attack rate parameter) 
Apply nonlinear function to gain factor. Final output is 
result of multiplying input sample by modified gain 
factor. 
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